Inventi Impact: Multimedia

Articles

Inventi:emm/114590/26

Enhancing Autonomous Vehicle Segmentation with Compact UNET Models and Temporal Input Fusion

01-Apr-2026 Research 2026 : April-June

Matúš Čavojský, Štefan Mikula, Juraj Gazda, Gabriel Bugár

This work addresses the challenge of video semantic segmentation, a critical component in applications such as autonomous driving. The primary aim was to explore the role of temporal awareness in video sequences and its impact on split computing. To achieve this, we analyzed existing deep neural networks for semantic segmentation and their computational demands and proposed a split computing architecture that leverages high-accuracy segmentation results from a remote server to enhance performance on mobile devices. To validate the proposed approach, we developed and tested four U-Net modifications on the CamVid dataset. Our results demonstrate that incorporating segmentation masks from previous frames significantly improves accuracy in split computing scenarios. In particular, masks warped using optical flow yielded the best results, increasing segmentation accuracy from 81.1% to 84.1% with minimal additional computational cost. These findings highlight the potential of time-aware split computing to enhance video semantic segmentation performance in resource-constrained IoT environments.

How to Cite this Article
Attribution/ CC Compliant Citation: Čavojský, M.; Mikula, Š.; Gazda, J.; Bugár, G. Enhancing Autonomous Vehicle Segmentation with Compact UNET Models and Temporal Input Fusion. Eng. Proc. 2025, 113, 71. https://doi.org/10.3390/engproc2025113071 http://creativecommons.org/licenses/by/4.0/ Some formatting elements, header, footer, logos, dates and pagination were modified while adapting this article.
Download Full Text

Call Us: +4 (800) 888-0008

Inventi Impact: Multimedia

Articles

Inventi:emm/114590/26

Enhancing Autonomous Vehicle Segmentation with Compact UNET Models and Temporal Input Fusion

How to Cite this Article

Links

Contact Us